Speech quality improvement in TTS system using ABS/OLA sinusoidal model

نویسندگان

Jae-Hyun Bae

Heo-Jin Byeon

Yung-Hwan Oh

چکیده

In this paper, we propose a novel unit concatenation and synthesis method using ABS/OLA sinusoidal model. Phase succession is used in the unit synthesis assuming that the pitch onset time of the rst frame in a given unit is the frame center. In the unit concatenation, the phase succession and interpolation of the sinusoid amplitudes via several frames around the concatenation point is utilized. As a result of applying this method to the Text-toSpeech(TTS) system, we got speech samples which were more intelligible and natural than those produced by conventional method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An enhanced ABS/OLA sinusoidal model for waveform synthesis in TTS

This paper describes a method for text-to-speech waveform synthesis based on the Analysis-by-Synthesis/Overlap-Add (ABS/OLA) sinusoidal model. This model has been shown in previous work to be a useful framework for pitch and time-scale modi cation of both speech and music signals. This paper explores some extensions of the original ABS/OLA formulation that attempt to overcome speci c artifacts,...

متن کامل

Practical high-quality speech and voice synthesis using fixed frame rate ABS/OLA sinusoidal modeling

This paper describes algorithms developed to apply the Analysis-by-Synthesis/Overlap-Add (ABS/OLA) sinusoidal modeling system to real-time speech and singing voice synthesis. As originally proposed, the ABS/OLA system is limited to unidirectional timescaling, and relies on variable frame length to accomplish time-scale modification. For speech and voice synthesis applications, unidirectional ti...

متن کامل

Speech concatenation and synthesis using an overlap-add sinusoidal model

In this paper, an algorithm for the concatenation of speech signal segments taken from disjoint utterances is presented. The algorithm is based on the Analysis-bySynthesis/Overlap-Add (ABS/OLA) sinusoidal model [1, 2, 3], which is capable of performing high quality pitchand time-scale modi cation of both speech and music signals. With the incorporation of concatenation and smoothing techniques,...

متن کامل

Sinusoidal model parameterization for HMM-based TTS system-Interspeech2010_v2.1.1

A sinusoidal representation of speech is an alternative to the source-filter model. It is widely used in speech coding and unit-selection TTS, but is less common in statistical TTS frameworks. In this work we utilize Regularized Cepstral Coefficients (RCC) estimated in mel-frequency scale for amplitude spectrum envelope modeling within an HMM-based TTS platform. Improved subjective quality for ...

متن کامل